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04 , Abstract. We study a generalization of the classic paging problem that 

5-H ' allows the amount of available memory to vary over time - capturing a 

Q,' 

fundamental property of many modern computing realities, from cloud 



computing to multi-core and energy-optimized processors. It turns out 
that good performance in the "classic" case provides no performance 
guarantees when memory capacity fluctuates: roughly speaking, mov- 
ing from static to dynamic capacity can mean the difference between 
optimality within a factor 2 in space and time, and suboptimality by 
an arbitrarily large factor. More precisely, adopting the competitive 
analysis framework, we show that some online paging algorithms, de- 
spite having an optimal (h, k) — competitive ratio when capacity remains 
constant, are not (3, k) — competitive for any arbitrarily large k in the 
presence of minimal capacity fluctuations. 

D""- In this light it is surprising that several classic paging algorithms 

perform remarkably well even if memory capacity changes adversarially 
- even without taking those changes into explicit account! In partic- 

—j. ' ular, we prove that LFD still achieves the minimum number of faults, 

c^J ■ and that several classic online algorithms such as LRU have a "dynamic" 

(h, k)— competitive ratio that is the best one can achieve without knowl- 
edge of future page requests, even if one had perfect knowledge of future 
capacity fluctuations (an exact characterization of this ratio shows it is 

J^ , almost, albeit not quite, equal to the "classic" ratio k _ k h+1 ). In other 

words, with careful management, knowing/predicting future memory re- 
sources appears far less crucial to performance than knowing/predicting 
future data accesses. 
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1. Introduction 

This article examines a generalization of the classic paging problem that 
allows the amount of available memory to vary over time. After briefly 
reviewing the paging problem (subsection II. 1|) this section motivates paging 
with dynamic capacity (subsection I1.2J) and provides an overview of our 
results and of the organization of the rest of the article (subsection II .3[) . 

1.1. The paging problem. The memory/data storage system of modern 
computing devices is almost always organized as a hierarchy of several layers 
of progressively larger capacity but also higher access cost (in terms of both 
time and energy) . The ever widening gap in both capacity and cost between 
different layers makes the paging problem, i.e. the problem of efficiently 
orchestrating the flow of information across the memory hierarchy, crucial 
to the performance of a computing device. The most widely used theoretical 
model for studying paging is that of a two-layer system: a smaller memory 
layer with a capacity of k pages (data blocks), and a larger layer of infinite 
capacity whose pages can only be accessed by first copying them into memory 
- an operation usually termed a (page) fault. Given any sequence of pages 
that must be accessed in order, an algorithm for the paging problem must 
choose which page(s) to "evict" from memory, whenever a new page must 
be copied into it, so as to minimize the total number of faults. 

The simple algorithm LFD (Longest Forward Distance) that evicts the 
page accessed furthest in the future has long been known to be optimal [4J. 
However, paging is often studied as an online problem, i.e. an algorithm 
can decide evictions only on the basis of past requests. One very popular 
framework for evaluating the performance of online paging algorithms is 
that of competitive analysis [18J. A paging algorithm is said to have an 
(h,k) — competitive ratio of (no more than) p if, for every request sequence, 
it incurs in expectation with a memory of capacity k at most p times as 
many faults as an optimal offline algorithm incurs with a memory of capacity 
h < k, plus a number of faults independent of the request sequence. The 
ratio ^ is called the resource augmentation. Resource augmentation and 
competitive ratio capture, respectively, the space and access cost overheads 
incurred by an online algorithm. 
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Many simple, deterministic algorithms including LRTJJ, FIFQj, FWFu 
and CLOCKQ have an (h, /c)-competitive ratio of k _ h+ i |26(, [6]; and the 
same ratio holds for RANE|j [25] • This ratio is optimal for deterministic al- 
gorithms, and even for randomized ones if page requests can depend on pre- 
vious choices of the paging algorithm (the "adaptive adversary" model [25] 
which we adopt throughout this articlqj)- 

Since fc _ fc / 2+1 < 2, many simple online algorithms never fare worse than 
the optimal offline algorithm would on a memory system with half the capac- 
ity and twice the access cost. This justifies the use of competitive analysis for 
preliminary performance evaluation of paging algorithms. Its "worst-case" 
approach may be somewhat pessimistic, but it is not overly so for many 
popular online paging algorithms - for which it provides guarantees of per- 
formance within a factor 2 of the optimal under any workload (in terms 
of faults and required memory capacity). In contrast, the finer granular- 
ity evaluation provided by experimental benchmarking is inevitably tied to 
specific workloads. 

[6], [10] and [8] provide three excellent surveys of the many variants of 
competitive analysis for the paging problem: these include somehow limiting 
the choice of the adversarial request sequence [12j [IHJ [7J [1] , amortizing the 
performance evaluation over a spectrum of sequences [U O [3] or of memory 
capacities [29] , considering pages of different size and access cost [HI ES] , 
and accounting for the non-zero cost of non- fault requests [27j . 

1.2. Paging with dynamic capacity. Throughout the long history and 
the many variants of the paging problem, memory capacity has generally 
been assumed to remain fixed throughout the request sequence. This no 
longer reflects many important computing realities. 



Least Recently Used - evict the least recently accessed page 

o 

First In First Out - evict the page brought least recently into memory. 

Flush When Full - evict all pages whenever memory is full and space is needed. 

Mark any page accessed; to evict a page, cycle through pages, unmarking those found 
marked and evicting the first found unmarked. 
5 Evict a page chosen uniformly at random. 

More precisely, in the online adaptive adversary model, the choices of the reference 
offline algorithm can depend only on the past random choices of the online algorithm; 
in the offline adaptive adversary model they can depend on the random choices of the 
online algorithm over the entire request sequence. The bounds above - and in fact all the 
bounds in this article - hold for both models, with one exception: the upper bound on 
the competitive ratio of RAND above, and the corresponding upper bound we provide for 
RAND in theorem O only hold in the adaptive online model. 
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In a cloud computing environment, the amount of physical memory avail- 
able to an individual virtual machine does vary considerably over time de- 
pending on the virtual machine's load and on the number, load and relative 
class of service of other virtual machines hosted on the same hardware. Even 
on a simple PC, most modern operating systems have and use the option 
of declaring some critical virtual pages temporarily "unswappable" , pinning 
them in main memory and thus reducing the amount of main memory avail- 
able to user processes. 

Memory fluctuations also take place when considering the cache-RAM 
interface - in which case memory represents cache memory and pages rep- 
resent cache lines. In many multi-core processor designs cache capacity is 
partitioned dynamically between different cores [21] . And low-power chip 
designs can often dynamically disable underutilized portions of the cache to 
save energy [16] . again resulting in a capacity that can vary over time. 

This article studies an extension of the classic paging problem that ad- 
dresses these issues allowing memory capacity to fluctuate between 1 and k 
pages (instead of being constantly equal to k). These fluctuations may be 
either known beforehand to the paging algorithm (an "offline" problem), or 
unknown until they take place (an "online" problem). Note that, although 
there exists a large body of work on servicing the same request sequence 
with a policy that is simultaneously "good" on memories of different [2U] 
and perhaps unknown [13] but unchanging capacity, allowing capacity to 
vary dynamically during the course of the computation is an entirely differ- 
ent problem; as we shall see, a solution to the former does not guarantee 
even an approximate solution to the latter. 

[9| [2T| recently introduced the related, but fundamentally different, prob- 
lem of RAM rental. In RAM rental memory capacity can fluctuate under 
control of the paging algorithm, and the goal is to minimize a linear com- 
bination of average capacity and fault rate over time. In practice there 
are usually very strong constraints on the set of admissible capacity values, 
on how they can change over time, and on their relative costs (which may 
themselves fluctuate). Also, a number of architectural approaches (e.g. [TT] ) 
decouple the portion of the system responsible for page replacement from 
that responsible for capacity allocation. Then, assuming as we do that ca- 
pacity fluctuations are not controlled by the paging algorithm (in fact, that 
they may be unknown beforehand and even chosen adversarially) leads to 
a more robust evaluation of page replacement policies. As we shall see, it 
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turns out that page replacement can be decoupled well from capacity con- 
trol, yielding robust replacement policies that do not depend on the capacity 
choices or costs, while at the same time simplifying the RAM rental problem 
(which is still widely open - e.g. little is known in terms of lower bounds or 
resource augmentation). 

1.3. Our results. The rest of this article is organized as follows. Section [2] 
introduces some formalism and terminology. In particular, it extends the 
notion of "online vs. offline" problem to encompass the extra dimension 
of future memory capacity, and it extends the notion of (h, k) — resource 
augmentation to the dynamic capacity scenario (in a nutshell, restricting the 
offline algorithm to at most a fraction | of the online algorithm's current 
memory capacity). 

Section [3] shows the existence of online paging algorithms that have an 
(optimal) (h, /^-competitive ratio of k _ h+l in the "classic" paging model, 
and yet are no longer (3, /c)— competitive for any arbitrarily large k if their 
memory capacity is subject to single page fluctuations. This very nega- 
tive result provides strong justification for our inquiry, as one cannot infer 
performance in the presence of (even minimal) memory fluctuations from 
performance in their absence. 

In this light, it is quite surprising that many well-known algorithms per- 
form remarkably well in the presence of memory fluctuations even if those 
fluctuations are chosen adversarially. In section H] we show that the classic 
LFD algorithm remains optimal for all possible memory capacity fluctua- 
tions even though it does not explicitly take those fluctuations into account 
(i.e. it is an online algorithm in terms of memory fluctuations). We also 
show that in the dynamic capacity framework every online algorithm that 
is either marking [12] like LRU, FWF or MARKLj, or dynamically conser- 
vative (a simple refinement of the notion of "conservative algorithm" |28j), 
like LRU, CLOCK or FIFO, has an (h, k)— competitive ratio no larger than 

PEL{h,k) = max / fc/<fc k'tN ; — b — TT7- Exactly the same bound holds for 

RAND (against an online adaptive adversary). 

Section [5] analyses pEL,(h,k). We show that it is a lower bound to the 
(h, k)— competitive ratio achievable by any online paging algorithm in the 



Mark any page accessed; evict a random unmarked page, first unmarking all pages if all 
are marked. 
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presence of memory fluctuations, proving the optimality of marking and dy- 
namically conservative algorithms. We also show that pEL(h, k) almost, but 
not quite, matches the "classic" bound of k _ h+i on the (h, ft)— competitive 
ratio. More precisely, pEi{h,k) is at least 1 + (^ — p-) times as large as 
k _ h+ i for any odd h and ft = 2h, but it is always less than 1 + ^ times as 
large - and if h > ft — \fk the two quantities actually coincide. 

Section [6] briefly looks at the implications of our results for the RAM 
rental problem. In a nutshell, since many simple replacement are near op- 
timal regardless of capacity fluctuations, RAM rental is simplified into the 
problem of just choosing a "good" capacity sequence without worrying about 
replacement. 

Finally, section [7] summarizes our results and looks at their significance 
and at possible directions of future work. 

2. Some formalism/terminology 

We can easily extend the notion of request sequence a = t\ , Ti , . . . to 
the case of memory fluctuations. We simply assume that, interleaved with 
standard page requests, it is possible to have two additional types of requests, 
growths and shrinks. On a growth, memory capacity increases by 1 page; 
on a shrink, it decreases by 1 - and if the memory was full a page must be 
evicted. We assume that initially memory capacity is 0. Throughout the 
rest of this article, we denote a growth request by the symbol + and a shrink 
request by the symbol — , and we denote k consecutive growths / shrinks 
by + fc and — k . Thus, a standard request sequence p%, . . . ,p n on a memory 
of capacity k simply becomes + ,pi, . . . ,p n in the more general dynamic 
capacity framework. 

The request sequence automatically induces a page sequence % =< px,P2, • • • > 
(the sequence of requested pages p\ , pi , . . . , as in the classic paging problem) 
and a capacity sequence \i = mi,m2, ■ ■ ■ where mi is the memory capacity 
immediately before the request for pi (i.e. it is equal to the number of +s 
minus the number of — s in the request prefix ending with pi). Note that the 
presence of growths and shrinks introduces a second aspect of "onlineness" . 
More formally: 

Definition 1. A paging algorithm ALG is online relative to the page se- 
quence if its eviction choices before servicing a request are independent of 
any future page requests; otherwise it is offline relative to the page sequence. 
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Similarly, ALG is online relative to the capacity sequence if its eviction 
choices before servicing a request are independent of any subsequent growths 
and shrinks; otherwise it is offline relative to the capacity sequence. ALG 
is a fully online, partially offline and fully offline paging algorithm if it is 
online relative to (respectively) both, one, or neither of the page and the 
capacity sequence. 



Thus, in the dynamic capacity model, all well-known paging algorithms 
such as LRU, FIFO, FWF, CLOCK, RAND and MARK are fully online, 
and LFD is partially offline, being offline relative to the page sequence even 
though it is online relative to the capacity sequence. 

We can easily extend the notion of (h, k) — competitive ratio to the dy- 
namic capacity model by comparing the cost (i.e. number of faults) incurred 
by an online algorithm whose memory capacity never exceeds k to the cost 
incurred by an offline algorithm whose memory capacity never exceeds jr 
times that of the online algorithm. More formally, denote by OPT the opti- 
mal offline algorithm, and by calg( 7F )M) the cost incurred by an algorithm 
ALG when servicing a page sequence n = pi, . . . ,p n with a capacity sequence 
H = mi, . . . , m n . Also, given a capacity sequence p = mi, . . . , m n and a non- 
negative number a, denote by \_a ■ p\ the capacity sequence m'-y, . . . , m' n with 
m'j = [a ■ mi\ ■ Then: 



Definition 2. A paging algorithm ALG has a dynamic (h,k) — competitive 
ratio of (at most) p if there exists some constant d such that, for any page 
sequence it = pi, ■ ■ ■ ,p n and any capacity sequence a = mi, . . . , m n such 
that, \/i, rrii < k: 

calg{k, h) < p ■ cqpt(tt, It ■ /"J ) + d 



Note that the dynamic (h, k)— competitive ratio of an algorithm is always 
an upper bound to its {h, k)— competitive ratio. Thus online paging with 
dynamic capacity is in some sense "harder" than classic online paging, and 
no online algorithm can have a dynamic (h, k)— competitive ratio lower than 
the "classic" ratio k _ h+l ■ 
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3. Minimal capacity fluctuations can lead to arbitrarily 
large performance degradation 

This section shows that there exist online paging algorithms that do not 
depend explicitly on memory capacity, and that have an optimal (h, k)- 
competitive ratio in the classic setting of fixed memory capacity, but are 
not competitive at all, even with arbitrary resource augmentation, when 
faced with even slight fluctuations in memory capacity. Consider the online 
paging algorithm LFRU (Least Frequently / Recently Used) that starts as 
LRU and then alternates between LFU and LRU - switching from LRU to 
LFU after any palindrome subsequence incurring more faults in its second 
half, and switching from LFU to LRU after any palindrome subsequence 
incurring more faults in its first half: 

Algorithm 1 LFRU: service po, . . . ,p n as follows 
at po POLICY <- LRU 

for i = 1 ... n do 

if at p l POLICY = LRU AND 3j < i: 

< Pj . . .pi > is palindrome AND faults(pj . . .p, i+± ,) < faults(pr«+j-i . . .pi) 
then at p i+1 POLICY <- LFU 

else if at p z POLICY = LFU AND 3j < i: 

< Pj . . .pi > is palindrome AND faults(pj . . .p, «+,? .) > faultsfjOr.+j-i . . .pi) 
then at p l+1 POLICY <- LRU 

end for 

We would convince the reader that LFRU, while somewhat artificial and 
difficult to implement in practice, is not too different from many real-world 
paging heuristics designed for static memory capacity (note that the be- 
haviour of LFRU, like that of LRU and LFU, does not depend explicitly on 
memory capacity). In fact, pure LRU tends to be outperformed in practice 
by various LRU/LFU hybrids |20|, 122] . The reason for this is the regrettably 
common coexistence of "local" or "temporal" computations, exhibiting a 
high degree of temporal locality and data reuse, with "streaming" computa- 
tions that access long sequences of sequential data with no temporal locality 
at all. In such cases, under LRU and similar policies such as CLOCK, 
streaming data not only gain no benefit from the use of a cache (since every 
new access is a fault) but pollute the cache, forcing the eviction of temporal 
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data and preventing the temporal computation from deriving more than a 
minimal benefit from the cache. One possible solution is to combine LRU 
with eviction schemes biased, like LFU, against data that have no reuse 
history even if their last (and only) access was very recent. And since LRU 
performs best when future requests are a "mirror image" of the past, it 
may seem reasonable to switch to it when such palindrome sequences ex- 
hibit good caching behaviour, and switch to LFU when such palindrome 
sequences exhibit poor caching behaviour - which is exactly what LFRU 
does. 

It turns out that LFRU has an optimal (h, k)— competitive ratio in the 
classic paging model where memory capacity is fixed. At the same time, 
even if faced with capacity fluctuations of just a single page, and even if 
allowed the use of an arbitrarily large amount of memory, LFRU's fault rate 
can be arbitrarily larger than that of an offline algorithm running with just 
3 pages of memory. More formally we prove: 

Theorem 1. LFRU has an (h,k) — competitive ratio equal to k _ h+l if mem- 
ory capacity is constant, but has no finite dynamic (h,k) — competitive ratio 
for any h > 3 and any arbitrarily large k. 

Proof. Let us first prove that LFRU has an (h, k)— competitive ratio equal to 
k _ h+l if memory maintains an arbitrary but fixed capacity k. We need only 
prove that, as long as LFRU keeps behaving as LRU, on no page request 
sequence a palindrome subsequence incurs more faults in its second half: 
then LFRU keeps behaving exactly as LRU and shares its (h, fe)-competitive 
ratio of F rf+ T . 

Consider a palindrome page subsequence tt = pi x , . . . ,Pi e ,Pi e , ■ ■ ■ ,Pi ± of 
even length 2£, containing A < £ distinct pages p±,...,p\. Note that, if 
A < k, by the end of the first half of n the A most recently requested pages 
are pi, . . . ,p\, which are then in memory and prevent any fault from taking 
place during the second half of tt. Then, we need only consider the case 
A> k. 

Let us focus on the first half of tt. For each distinct page, we analyse 
separately the first request to it (which we call a cold request), and the 
remaining requests, if any (which we call hot requests). The i cold request 
is certainly a fault for any i > k, since at least k distinct pages have been 
requested before it in p^ , . . . , pi e ; so the number of cold requests incurring 
faults is at least £ — k. Let us now look at hot requests, and let ri be the 
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number of hot requests of pi in the first half of n. For 1 < i < A and 
1 < J ' < Ti, let D\ be the set of distinct pages requested between the j th 
hot request for pi and the previous request for pi, inclusive (so Dj always 
includes pi). Then the j th hot request to pi is a fault if and only if \D? | > k, 
and the total number of faults in p^, . . . ,pi e is: 

(1) fl>(£-k) + \{(i,j):\D{\>k}\ 

Let us now focus on the second half of ir. Again, we divide requests for 
any distinct page into a cold request (the first) and hot requests (subsequent 
ones, if any). The first k cold requests ofpi e , . . . ,pi x are for the last k distinct 
pages requested in p^ , . . . , pi e , which are then present in memory at the 
beginning of Pi t , . . . , Pi x . So in pi e , . . . , pi x none of the first k cold requests 
incurs a fault, yielding and at most X — k faults on cold requests. Let us 
now look at the hot requests of Pi e , . . . ,pi x \ those for pi are obviously r^, as 
in the first half of n. For 1 < i < A and 1 < j < n, let .£>*• be set of distinct 
pages between the j th hot request for pi and its previous request, including 
Pi itself; then the j th hot request for pi is a fault if and only if \Dj | > k, and 
the total number of faults in pi n . . . ,pi x is: 

(2) fl<(£-k) + \{(i,j):\D{\>k}\ 

It is crucial to observe that, since n is palindrome, D % - = D l r ,_- +1 . Then 

\{{i,j) : \D>\ > k}\ = \{(i,j) : \Dj\ > k}\ and /| > /J. The analysis is 
virtually identical for palindrome subsequences of odd length; and thus with 
static memory capacity LFRU incurs no more faults on the second half of any 
palindrome subsequence than in the first half and has an (h, A;)— competitive 
ratio equal to .£,-, . 

To prove that LFRU can incur arbitrarily more faults than an optimal 
offline algorithm OPT when memory capacity fluctuates - even if OPT is 
limited to a capacity fluctuating between capacity 3 and 2, while LFRU's 
fluctuates between 3m and 2>m — 1 for an arbitrarily large m - we show how 
LFRU can be coaxed into, and kept in, LFU behaviour, and how that be- 
haviour can result in arbitrarily more faults than OPT even with arbitrarily 
larger capacity. 
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Denote by p e the subsequence consisting of £ consecutive requests for p, 
and consider the page sequence: 

7T1 =< Pl,P2,pi ■ ■ ■ ,pim,Pl,P2,P3, ■ ■ ■ ,P3m,P3m, ■ ■ ■ ,P2,Pl > 

with LFRU's memory capacity fixed at 3m except for the last 3m requests, 
during which it drops by 1 to 3m. — 1. It is immediate to see that when 
capacity drops p\ is evicted, and that the last 6m requests form a palindrome 
subsequence experiencing a fault (only) on the last request. Thus, on the 
last request of n, LFRU switches to LFU behaviour, and evicts p2 (which, 
like pi, has experienced £—1 fewer requests than every other page pi, i > 3). 
If the subsequence of requests n' = (p2,PiY 1 follows, with memory capacity 
remaining fixed at 3m. — 1, LFRU keeps evicting in turn p\ and p2, which 
remain the two pages having experienced the fewest requests; and LFRU 
incurs at least 2£ — 2 faults. 

An optimal algorithm (or even just LRU) with memory capacity 3 through- 
out all but the last 3m requests of n, and with memory capacity 2 thereafter, 
would instead incur no more than 3m + 3m + 3m = 9m faults during it, and 
no faults at all during n' (retaining only p\ and p2 in memory). Thus, since 
£ can be chosen arbitrarily larger than m, LFRU does not have a finite 
(3, 3m,)— competitive ratio for any arbitrarily large m. □ 



4. Even adversarial fluctuations can be addressed efficiently 

(and "implicitly") 

In the light of theorem Q] it may be somewhat surprising many well-known 
"good" paging algorithms still perform remarkably well in the dynamic ca- 
pacity setting - even though they do not take memory fluctuations into 
explicit account. It is very easy to prove: 

Theorem 2. LFD incurs the minimal number of faults on any request se- 
quence. 

Proof. We can safely ignore algorithms leaving unoccupied space in memory 
after an eviction, as such an eviction could be delayed without incurring 
additional faults. Let a page be close if it will be accessed before another 
page currently in memory, far otherwise. LFD is the algorithm evicting no 
close pages. We prove the theorem showing that one can always eliminate 
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the earliest close eviction without altering previous evictions or increasing 
the number of faults. 

Let p be the close page evicted earliest, at time t, by an algorithm ALG 



servicing a request sequence. Consider the algorithm ALG that operates 
as ALG until t, when it instead evicts a far page p, and then operates as 
follows. Denote by M and M the sets of pages respectively in ALCs and 
ALG } s memory. When both ALG and ALG must incur an eviction, ALG 
evicts the same page as ALG if possible; otherwise, when ALG must incur 
an eviction, it evicts a page not in M (as soon as M = M, ALG and ALG 
coincide). After t, let if be the time of the first request or eviction of either 
p or p. Until if ALG and ALG incur exactly the same faults and evictions, 
and thus M\~M = {p] and ~M\M = {p}. At t' ALG evicts p if and only if 
ALG evicts p - in which case M and M converge. Otherwise p is requested 
at if and ALG, but not ALG, incurs a fault; and since ALG never evicts a 
page unless ALG also has evicted it, \M \M\ never increases after t, and 
drops to no later than the first fault incurred by ALG and not by ALG. 
In both cases ALG incurs no more misses than ALG. □ 

It is interesting to note that theorem [2] yields as an immediate corollary 
theorem 4.1 in |15j - in a nutshell, for a given, dynamically changing, par- 
tition of the memory space between different processes, using LFD for each 
process on its own partition yields the minimum total number of faults. It 
is not, however, immediately obvious that the result in [15] implies our the- 
orem [2j Furthermore, the result in |15j is only stated, and not proved - the 
proof is deferred to the full version of the article because of its complexity 
compared to the "classic" proof of LFD's optimality. 

Let us now focus on online paging algorithms. It turns out that the dynamic 
(h, k)— competitive ratio achievable by many well-known online algorithms 
is almost, but not quite, as good as the "plain" (h, k) — competitive ratio 
k _ h+1 - and in particular equal to: 

k' 

PEL(h,k) = max 



k'<k,k'eAf k> - L/if - f J 
The formula for pel is more complex, but vaguely reminiscent of the 
formula for the "classic" (h, fe)-competitive ratio; and indeed it is easy 
to verify that for h = k both equal k. A detailed analysis of the be- 
haviour of pel, including a proof that it is a lower bound on the dynamic 
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(h, k)— competitive ratio of any online algorithm, can be found in the fol- 
lowing section \E\ The remainder of this section is devoted to proving that a 
dynamic (h, k)— competitive ratio pEL(h,k) is indeed achieved by all mark- 
ing algorithm^ (including MARK, LRU and FWF), by RAND, and by all 
dynamically conservative algorithms. The latter form a class of algorithms 
that is slightly narrower than that of conservative algorithmgj[28j but still 
includes LRU, FIFO and CLOCK. The cornerstone of the analysis lies in the 
notion of short subsequence, which is the "correct" extension of the concept 
of k— phase to dynamic capacity: 

Definition 3. Given a generic (sub)sequence of consecutive requests, its 
width is the number of distinct pages in it. 

Definition 4. Consider a generic request sequence a, and a subsequence a' 
of consecutive requests in a (including page requests, growths and shrinks), 
a' is short if, for every prefix n of a' , the width of tt does not exceed the 
memory capacity at the end of it. 

Definition 5. A dynamically conservative algorithm never incurs more than 
w faults on any short subsequence of width w. 

Note that a dynamically conservative algorithm is also always a conser- 
vative algorithm according to the definition of [28] since with a memory of 
fixed capacity k every short subsequence involves access to at most k pages, 
and thus incurs at most k faults. The reverse is not true: LFRU from sec- 
tion [3] is conservative but not dynamically conservative. However, we can 
easily prove: 

Theorem 3. LRU, FIFO and CLOCK are dynamically conservative. 

Proof. It is not difficult to verify that all three algorithms have the following 
property: if a page p is brought into memory at time t, and a page p' 
already in memory at time t and is never accessed again, then p' will be 
evicted before p. This holds for LRU because p is more recently accessed 
than p' . It holds for FIFO because p' entered the memory before p. It 
holds for CLOCK because after t the unmark/evict process will encounter 



A marking algorithm marks a page in memory whenever it accesses it, never evicts a 
marked page, and unmarks all pages if all are marked and one must be evicted (e.g. in 
response to a fault or a shrink). 

A conservative algorithm never incurs more than k faults on a sequence of accesses 
involving at most k distinct pages and a memory of capacity k. 
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p' before encountering p - thus either evicting or at least unmarking p' before 
unmarking p, and thus certainly evicting it before evicting p. Then none of 
the three algorithms evicts a page accessed during a short sequence before 
the end of the sequence (since there is always sufficient memory to hold all 
pages accessed during the sequence), and thus none can incur more faults 
than the width of the sequence. □ 

The main result of this section is then: 

Theorem 4. The dynamic (h,k) — competitive ratio of any online paging 
algorithm that is either marking or dynamically conservative is no larger 
than pEL(h, k). 

Proof. Let us begin with marking algorithms. The proof bears some re- 
semblance to that of the static case, with a number of subtle but profound 
differences. One such difference is that, instead of partitioning the request 
sequence into maximal length phases each involving access to k distinct 
pages, we partition it into maximal short sequences ir\, . . . ,ir n where Hi is 
the longest short sequence beginning immediately after the end of 7Tj_i. 

Denote by Wi the width of m. We can assume without loss of generality 
that the request sequence ends with a page request, so Wi > Vi. Note that, 
for i > 1, the first request of 7Tj must be either a shrink or a request for a 
page not in 7Tj_i; in the first case we say that 7Tj_i is capacity bound, in the 
second that it is page bound. 

It is easy to verify by simultaneous induction the following claims hold 
for all i: 

(1) All pages in memory are unmarked when 71^1 is serviced. 

(2) Every one of the wi pages accessed during 7Tj (and no other page) 
remains marked and thus in memory until the end of 7Tj. 

(3) Immediately before 7Tj+i 1 is serviced, the memory is full and holds 
wi pages, all marked. 

Claim 1 holds trivially for i = 1. If Claim 1 holds for i, Claim 2 also holds 
for i, since until the end of 7Tj the memory is large enough to accommodate all 
pages accessed so far during 7Tj, which are the only ones marked. If Claim 
2 holds for i, Claim 3 also holds for i, since immediately before 7Tj+i 1 is 
serviced the memory capacity exactly matches the number of distinct pages 
accessed in m. If Claim 3 holds for i, Claim 1 holds for i + 1 (proving the 
inductive step), since the first request of 7Tj + i must be either a shrink or a 
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request for a page not in 7Tj, and thus causes all pages in memory to become 
unmarked. 

From Claim 2 it is obvious that a marking algorithm incurs a number of 
faults at most equal to Wi during short sequence in, for a total number of 
faults equal to at most: 



(3) c ALG < J2 






Let us compute the number of faults incurred by any other algorithm ALG 
with a memory of capacity at most r times that of the marking algorithm, 
in the interval ~k\ from immediately after the first request a of 7Tj is serviced, 
to immediately after the first request of 7Tj + i is serviced or to the end of 
the request sequence if i = n. Let n be equal to 1 if 7Tj is page bound, 
and to if it is capacity bound, for 1 < i < n, and let tq = and r n = 0. 
Remember that the first request of a short phase 7Tj is a shrink if 7Tj_i is 
capacity bound, and a page not in 7Tj_i if 7Tj_i is page bound - and a growth 
if i = 1. Denoting by w[ the number of distinct pages in n^ after removing 
the page involved in the first request of 7Tj if any (i.e. if 7Tj_i is page bound), 
we can then write for 1 < i < n: 

(4) w'i = -Ti-l + Wi+Ti 



The subset of these pages in the memory of ALG immediately before 
servicing the first request of -k\ is then at most: 



if i = 1, 

|£(u>i_i-l)J ifi>l. 



(5) m, 

> 

Equation [5] is immediate if i = 1 or if 7Tj_i is capacity bound - since then 
the first request of 7Tj shrinks the memory available to ALG from w^i to 
Wj_i — 1. If instead 7Tj_i is page bound, of the L^^i-iJ pages ALG's memory 
can hold, one must be the first page of 7Tj that has just been requested and 
that does not contribute to rrii - leaving only [^Wj_iJ — 1 < [^(^j-i — 1)J- 
Then the total number of faults incurred by ALG is at least: 
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£K 



mi) 

n 



{ ' i=l i=2 

^I>*-l£(«*-i)J) 

4 = 1 

Remembering that both W{ and Wj — \\Ub% — 1)J with h < k are positive, 
and that Vo, b, c, d > we have that ^ = ^.| + ^.^< max(§, j|), 
the dynamic (/j, fe)— competitive ratio of j4LG is at most: 






Er=iK-L|K-i)J) 



(7) 



< max 



(('; 



< max 



^-LfK-i)J 
k' 



k'e{i,...,k} k> - [h% - * j 

This proves the theorem for marking algorithms. The proof for dynam- 
ically conservative algorithms proceeds identically, except for the fact that 
in this case one can immediately obtain, from definition [5j the bound given 
by Equation [3] on the cost incurred by the online algorithm. □ 

The proof of theorem 2] is vaguely reminescent of that for marking and 
conservative algorithms in "classic" paging, but is considerably more com- 
plex: for example, the strategy of analysing each short subsequence in iso- 
lation does not work, and one can only bound the ratio over the whole 
sequence, through careful accounting and a potential argument. The anal- 
ysis of RAND faces similar difficulties in terms of "compartimentalization 
of costs" ; but they are addressed in a different way due to the randomized 
nature of the algorithm (by exploiting its lack of memory). In this sense 
it may be somewhat surprising that exactly the same bound obtained in 
theorem S] also applies to RAND: 

Theorem 5. RAND's dynamic (h,k)— competitive ratio is no larger than 
PEl{h,k) in the adaptive online adversary model. 

Proof. While the proof bears a few similarities to the analysis in the static 
case, it requires subtlety and a somewhat different approach due to the 
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possible fluctuations of memory capacity - and in particular to the fact 
that, if h < k, cache shrinks may not be "synchronized" and RAND may 
incur shrinks when the optimal offline algorithm OPT does not. Instead 
of comparing the number of faults grand and copt incurred, respectively, 
by RAND and OPT, we then begin by comparing the number of evictions 
grand and eopT- For simplicity, assume that, after any given request (for a 
page, or for a capacity change), the request is served in the following order. 
OPT performs any eviction(s); then it loads into memory any requested page 
not yet there; then RAND does the same; finally, OPT adjusts its memory 
capacity, and then RAND does the same. 

Let the garbage of RAND at any given point in time be the set G of pages 
in its memory and not in the memory of OPT. First of all, note that G can 
increase only when OPT incurs an eviction (and at most by 1 page for each 
eviction), since RAND never brings into memory a page not requested by 
OPT - which at that point must then be in OPT's memory. 

Immediately before RAND incurs an eviction, its memory must be full; 
denote by k' and h! the memory capacity of RAND and OPT at that point. 
If the eviction is the result of a shrink, then the number of pages in RAND's 
memory that are not garbage are at most: 

(8) h! = l^(k> - l)j 

Note that at this point OPT has adjusted its memory capacity to the 
shrink but RAND has not. If the eviction is the result of a page fault, then 
the requested page at this point is in OPT's memory but not in RAND's, 
and the number of pages in RAND's memory that are not garbage are at 
most: 



(9) ti-l = \±V\-l<\±{V-l)\ 

Thus the probability that, when RAND incurs an eviction, |G| decreases 
by 1 is at least: 



(10) PRAND = min 



k'-[^k'-l)\ 



k'e{i,...,k} k' 

and at any given time we have that, in expectation: 

(11) \G\ < e Q PT ~ eRAND ■ PRAND 
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Appending to any request sequence sufficient shrinks to bring RAND's 
memory capacity to obviously brings \G\ to 0, without increasing the 
number of faults incurred by RAND or OPT. For any algorithm that 
evicts a single page at a time, when the memory holds no pages the number 
of faults and evictions incurred must coincide. Setting |G| to 0, as well as 
copt = &opt and crand = crand , in Equation [TT] then yields for RAND 
a dynamic (h, k) — competitive ratio equal to at most: 

(12) C_ M ND^^_ = max // 



COPT PRAND k'e{l,...,k} k' - \h^ - |J 

□ 



5. An exact characterization of the competitive ratio 

The upper bound pEL(h, k) obtained in the previous section H] for the dy- 
namic (h, A;)— competitive ratio of many online paging algorithms is actually 
tight. More formally, we can prove: 

Theorem 6. No online paging algorithm has a dynamic (h,k) — competitive 
ratio (against any online of offline adaptive adversary if randomized) lower 

than: 

k' 
PEL(h,k) =max k , e{1 ^ ik} - — ^ ^ 



Proof. Let k = arg maxj/gh k \ — — -\, — —. and let ALG be a generic on- 

k '~l h —~ki 

line paging algorithm. Consider a request sequence o~ n =< +( fc ~ 1 ),7Ti, ... ,?r n >, 
where: 



(13) n =< +^-~ k+1 \ Phll -<*-*■!>,. . . , +^ k -~ k+1 \p irk , -^'^ > 

and pi j is any one page, from the set pi, ■ ■ ■ ,p^, that is not in ALG's memory 
just before it is requested - note that immediately before any page request 
ALG's memory holds at most k — 1 pages, so there always exists one such 
page. ALG then incurs a fault on every page request, for a total number of 
faults equal to: 



(14) calgKO = n ■ k 
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Consider an offline algorithm ALG with access to a memory that has 
at most jr times the capacity of ALG's at any given time; in particular, 
ALG's memory capacity grows to h immediately before any page request, 
and immediately afterwards drops to capacity: 



(15) l£(*-i)J = L^-£j<* 

ALG can easily maintain in its "permanent" [hj — -r\ memory locations 
the [hi — j; J pages with most expected accesses in a n , incurring for each 
only one initial fault. Note that the total number of accesses to these pages 

is, in expectation, at least nk ■ — k - k = n[h^ — ^J. Every other page, 
when requested, is brought into the "temporary" location(s) immediately 
eliminated by the following shrink. ALG then incurs an expected number 
of faults equal to: 



(16) cjj^anJ^L^-^+nCfc-L^-^J) 

Then the competitive ratio of ALG can be no lower than: 



,. CALGJPn) _,. nk 



-czsfrn) • • L fcj-4J+n(fc-L^-JJ) 

y 

= max n ;— 

k'e{i,...,k] k' - [h\ - |J 



It is important to observe that, if ALG is randomized, ALG need only 
know ALG's probabilistic behaviour to choose which pages to keep in its 
own memory; and it can choose which page to request next based only on 
ALG J s current memory contents. Thus the lower bound we proved holds for 
deterministic and randomized algorithms both in the adaptive offline and in 
the adaptive online adversary models. □ 

As noted in section[3]the expression of the optimal dynamic (h, /^-compet- 
itive ratio pEi{h,k) appears considerably more complex than, but vaguely 
reminiscent of, that of the "classic" bound on the (h, k)— competitive ratio, 
k _ h+1 - It is natural to ask whether the two are actually different, and if so 
to what extent. We show that pEh{h, k) is, in fact, a factor ~ 1 + r larger 
for some "natural" values of h and k - though it is never more than a factor 



20 ENOCH PESERICO 

1 + 75 l ar g er i an d actually coincides with k _ h+l if h is equal or very close to 
fe. This is stated more formally in the following two theorems: 

Theorem 7. For any odd h and k = 2h, pEh{h, k) > (1 + ^ — A) fc _^ +1 . 

Proof. For any integer i > 0, choosing h = 2i + 1, k = 2h, and fe' = fe — 1, 
we obtain immediately: 

4i + 1 Ai + 1 fe - 1 



(18) 



jfc-l |+1 fe 



| fe fe-/t + l 



k k 2 k-h + 1 



□ 



Theorem 8. fc _^ +1 < pEL(h,k) < (1 + %) fc _fc +1 /or a// /i, fe G Z + wi/t 
h < k, and PEL{h, fe) = fc _^ +1 if k > h > k — \fk. 

Proof. It is immediate to verify that, for fe' = fe: 

(19) p BL (/i, fe) > * j- = 7-4x7 

And since, if fe' < /i, we have that: 

fe' fe' 

< 



(2°) 



k'-lh k i-h-k'-(h k i-^ k ) 



v < 



then values of fe' < h can be disregarded in the max operation. To prove 
that, for all h < fe, pEL{h, fe) < (1 + k)~k~h+i > no * e that: 

fe' 
p E L{h,k) = ma% e{1 t} u _\uk!__ h\ 

(21) < wa^ 6{w} fc/ _ A) 

fe fe 

~*-(^-£)~ fc -fc + £ 
Then we obtain: 
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(22) P -^ 1 <^^r = l + ^^ = l + ^-<l + r 
*4+l -k-h + ^k k-h + % k +k ^ k 

To prove that pEL(h, k) coincides with k _ h , 1 for h > k—yk let us rewrite 
h and k' as h = k — a and k' = k — b, with a > b and a, b G Z^~. We obtain: 



/t — 6 
PEl(h > (k — vk), k) = max 



k-b 



max 



(23) 



^fe>a>6 k - b - [k - (a + b) + f - 1 + f J 



max 



a(6+l) 



Vk>a>b k-b-k + (a + b) + l- [ — - 

k k 

< max 



Vk>a>b k-h + 1- L^f^J k-h + 1 

where the last equality follows from the fact that, since a = k — h < yk 
and b<a-l<Vk-l, then a(b + 1) < k. D 

The complex expression of pEi(h,k) is in part due to the "rounding" of 
the memory capacity of the optimal offline algorithm. However, it is impor- 
tant to note that this rounding is not sufficient to explain why pEL(h, k) can 
be strictly larger than the "classic" ratio k _ h+i obtained when capacity is 
fixed at its maximum value: at smaller capacities rounding can only favour 
the online algorithm, and for any fixed ratio jj, k ,_ h , +1 strictly decreases 
with k' , again favouring the online algorithm at smaller capacities. Capac- 
ity fluctuations (rather than simply the choice between different, constant 
capacities) are then the source of the separation between pEi{h,k) and the 
"classic" (h, /^-competitive ratio k _ h+l - 

6. Decoupling replacement from capacity in RAM rental 

The results from section U] can be readily applied to the RAM rental 
problem, in which a paging algorithm ALG can choose the capacity sequence 
(with maximum capacity A;), and the cost it incurs and must minimize on a 
request sequence a is: 

M 
(24) R k ALG {a) = Y J ^f(i) + Pw{i)) 

i=l 
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where w(i) is the capacity when serving the i request of a, and f(i) is 1 
if that request is a fault and otherwise. The fundamental consequence of 
our results from section [5] is that to a large extent the replacement policy 
can be decoupled from the choice of capacities. More precisely, theorem U] 
yields: 

Corollary 1. Consider a paging algorithm ALG, servicing each request ai 
of a sequence a with capacity w(i) < h and an arbitrary (even offline) re- 
placement policy; and a second paging algorithm ALG' servicing o~i with 
capacity 2w(i) and a replacement policy that can be any marking or dynam- 
ically conservative algorithm. Then, for any choice of a, (3 and w(-) < h: 

(25) Rf LGI {a)<2-R h ALG {a). 

which follows immediately from the fact that the sum of all faults incurred 
by ALG' is at most twice that by ALG as long as ALG' maintains twice 
the capacity of ALG. In other words, RAM rental is all about choosing 
the correct capacity at any given time; and any of the "classic" replacement 
policies analysed in the previous section will be close to optimal for any 
choice of a, of j3, and of the capacity sequence. 

7. Conclusions 

Good performance in the case of constant memory capacity provides no 
performance guarantees whatsoever in the case of fluctuating memory capac- 
ity: moving from a scenario where capacity remains constant to one where it 
can fluctuate by a single page can mean the difference between performance 
optimal within a factor 2, and performance suboptimal by an arbitrarily 
large factor. This suggests the need of extreme caution when evaluating 
with classic methodologies the performance of paging algorithms meant for 
memory systems with dynamic capacity. 

A counterpoint to this very "negative" result is that several extremely sim- 
ple classic paging algorithms achieve optimal or nearly optimal performance 
even in the dynamic capacity framework. This is particularly surprising 
because none of these algorithms is designed to take memory capacity fluc- 
tuations into explicit account: counterintuitively, while knowledge of future 
page requests provides an advantage, knowledge of future memory capacity 
does not. A practical corollary is that, in the design of memory architec- 
tures, one can then efficiently decouple the problem of allocating memory 
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resources to different cores/processes/threads from the problem of managing 
the allocated memory - greatly simplifying system design and analysis and 
providing a strong (a posteriori!) theoretical justification for the exokernel 
approach [TT] . 

As in classic paging, in the dynamic capacity framework competitive anal- 
ysis fails to distinguish between the performance of LRU, of FIFO, and of 
more naive algorithms such as RAND or FWF - at least without resort- 
ing to more sophisticated approaches such as access graphs. While each of 
these algorithms is still guaranteed to outperform an optimal offline algo- 
rithm (and thus any other online algorithm) whose memory system has half 
the capacity and twice the access cost, there are probably differences within 
those factors of 2 that would be important to characterize in practice. It 
is by no means clear whether the winner in the dynamic capacity scenario 
would be the same as in the classic one, or whether models designed a pos- 
teriori to explain the superiority of e.g. LRU over FIFO would still provide 
correct predictions. 

In this sense we are not aware of any experimental benchmarks specifically 
designed to assess the impact of memory capacity fluctuations. A fundamen- 
tal obstacle in their development seems to be the difficulty of characterizing 
"typical" fluctuation patterns encountered in practice. An interesting line 
of inquiry would be to investigate whether one can obtain, from the perfor- 
mance numbers of a black box algorithm under a small "basis" of specific 
fluctuation patterns, sufficient information to compute a good assessment of 
the algorithm's performance numbers under any other pattern. 

Finally, the "dynamic resources" approach is not necessarily restricted 
to paging. There are a number of other problems where the amount of 
resources available for a task can realistically vary over time. Examples 
include call admission p3] (with variable circuit capacity) and the numerous 
variants of online scheduling [23J (with e.g. variable number or speed of 
servers). In addition to studying each problem individually, it would be 
extremely interesting to identify broad classes sharing similar characteristics. 
For example, which problems can be solved optimally or almost optimally 
without knowledge of the amount of resources available in the future (as in 
the case of paging with dynamic memory capacity)? 
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